A method of mapping a plurality of tasks and data onto a multiple processor, 
distributed memory hardware architecture, the method comprising: 

a) describing a task-level network of behaviors, each of the task-level 
network of behaviors being related through control and data flow; 

b) predicting a schedule of tasks for the task-level network of behaviors; and 

c) allocating the plurality of tasks and data to at least one of the multiple 
processors and to at least one of distributed memory, respectively, in 
response to the predicted schedule of tasks. 

The method of claim 1 wherein the predicting the schedule of tasks comprises 
minimizing execution time of the plurality of tasks. 

The method of claim 2 wherein the predicting the schedule of tasks comprises 
minimizing the schedule of tasks by allocating data to the distributed memories in 
order to minimize data transfers. 

The method of claim 1 wherein the predicting the schedule of tasks comprises 
maximizing parallel execution of the plurality of tasks on at least two processors 
of the multiple processors. 

The method of claim 1 wherein the allocating the plurality of tasks comprises 
allocating tasks to one of the multiple processors having optimal processor 
resources for the tasks. 

The method of claim 1 wherein the predicting the schedule of tasks comprises 
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2 using a resource-based model of the architecture to predict the schedule of tasks. 

1 7. The method of claim 1 wherein the predicting the schedule of tasks comprises 

2 using an interval graph and an execution time model of the task-level network of 

3 behaviors to predict the schedule of tasks. 

1 8 . The method of claim 1 wherein the allocating the plurality of tasks and data 

2 comprises an iterative allocation process. 

1 9. The method of claim 8 wherein the iterative allocation process comprises using a 

2 demand-driven and constraint-based objective function. 

1 10. The method of claim 1 wherein the describing a task-level network of behaviors 



S| 2 comprises describing a task-level network of behaviors in a high-level 



3 programming language. 

1 11. The method of claim 1 0 wherein the describing the task-level network of 



fu 

m 2 behaviors in the high-level programming language comprises parsing the high- 



|j 3 level programming language into an intermediate form. 

1 12. The method of claim 1 further comprising generating machine executable code 

2 for the multiple processor, distributed memory hardware architecture based at 

3 least in part on the allocating the plurality of tasks and data. 

1 13. The method of claim 1 wherein the allocating the plurality of data to the 

2 distributed memories comprises allocating data to shared memories. 

1 14. The method of claim 1 wherein the allocating the plurality of data to the 
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2 distributed memories comprises allocating data to private memories, 

1 15. A method for generating a control graph for a compiler used to map a plurality of 

2 tasks and data onto a multiple processor, distributed memory hardware 

3 architecture, the method comprising: 

4 a) parsing a plurality of tasks into an internal compiler form of 

5 interconnected task nodes; and 

6 b) linking a compiler representation to each interconnected task node using 

7 directed edges for the purpose of substantially simultaneously mapping the 
13 8 tasks to multiple processors in the multiple processor, distributed memory 
W 9 hardware architecture. 

M 
M 

[Jjj 1 16. The method of claim 1 5 further comprising binding the interconnected task nodes 

jg 2 to the directed edges. 

P ' 

jj| 1 17. The method of claim 1 5 further comprising parsing a plurality of data blocks into 

is 

pj 2 an internal compiler form of data nodes and linking a compiler representation to 

3 each data node using directed edges. 

1 18. The method of claim 1 5 wherein the directed edges represent time intervals. 

1 19. The method of claim 1 8 wherein the time interval comprises the time period 

2 between tasks. 

1 20. The method of claim 1 8 wherein the time interval comprises the time period from 

2 beginning a task to ending a task. 
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The method of claim 1 8 wherein the time interval comprises a set time period. 

The method of claim 18 wherein the time interval comprises a time period 
between time periods. 

A method for executing a schedule of tasks in a multiple processor, distributed 
memory architecture, the method comprising: 

a) generating the schedule of tasks based at least in part on a task-level 
network of behaviors; 

b) calculating a demand function based at least in part on a constraint related 
to at least one of a plurality of tasks in the schedule of tasks; and 

c) allocating a task having highest priority to a processor having least cost 
according to the demand function. 

The method of claim 23 further comprising allocating a data block to a memory in 
the distributed memory. 

The method of claim 23 wherein the demand function is calculated based at least 
in part on the task-level network of behaviors. 

The method of claim 23 wherein the demand function is calculated based at least 
in part on an impact on the schedule of tasks. 

The method of claim 23 wherein the demand function is calculated based at least 
in part on an impact on data movement. 
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The method of claim 23 wherein the demand function is calculated based at least 
in part on prior allocation decisions. 

The method of claim 23 wherein the cost is defined as the least negative impact 
on at least one performance factor. 

The method of claim 29 wherein the at least one performance factor comprises the 
schedule of tasks. 

The method of claim 29 wherein the at least one performance factor comprises 
data movement. 

The method of claim 23 further comprising allocating a task having next highest 
priority to a processor having next least cost according to the demand function. 

The method of claim 23 further comprising recalculating the demand function in 
response to each task in the plurality of tasks being allocated to a processor. 

A compiler for mapping a plurality of tasks and data onto a multiple processor, 
distributed memory architecture, the compiler comprising: 

a) means for describing a task-level network of behaviors, each of the task- 
level network of behaviors being interrelated through control and data 
flow dependencies; 

b) means for predicting a schedule of tasks for the task-level network; and 

c) means for allocating the plurality tasks and data to at least one of the 
multiple processors and at least one of the distributed memories, 
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respectively, in response to the predicted schedule of tasks. 

The method of claim 34 further comprising means for producing machine 
executable code for the architecture based at least in part on the means for 
allocating the plurality of tasks and data. 

The method of claim 34 wherein the multiple processors communicate using the 
distributed memories. 
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